Maximum within-cluster association

نویسندگان

  • Yongjin Lee
  • Seungjin Choi
چکیده

This paper addresses a new method and aspect of information-theoretic clustering where we exploits the minimum entropy principle and the quadratic distance measure between probability densities. We present a new minimum entropy objective function which leads to the maximization of within-cluster association. A simple implementation using the gradient ascent method is given. In addition, we show that the minimum entropy principle leads to the objective function of the k-means clustering, and the maximum within-cluster association is closed related to the spectral clustering which is an eigen-decomposition-based method. This informationtheoretic view of spectral clustering leads us to use the kernel density estimation method in constructing an affinity matrix.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust Report Level Cluster-to-Track Fusion

In this paper we develop a method for report level tracking based on Dempster-Shafer clustering using Potts spin neural networks where clusters of incoming reports are gradually fused into existing tracks, one cluster for each track. Incoming reports are put into a cluster and continuous reclustering of older reports is made in order to obtain maximum association fit within the cluster and towa...

متن کامل

Expectation Maximisation for Sensor Data Fusion

The expectation maximisation algorithm (EM) was introduced by Dempster, Laird and Rubin in 1977 [DLR77]. The basic of expextation maximisation is maximum likelihood estimation (MLE). In modern sensor data fusion expectation maximisation becomes a substantial part in several applications, e.g. multi target tracking with probabilistic multi hypothesis tracking (PMHT), target extraction within pro...

متن کامل

Genetic Variation within Iranian Iris Species Using Morphological Traits

Iris belongs toIridaceae family and it is monocotyledon. Iris is one of the important ornamental and medicinal plants. 34 iris genotypes (14 species) collected from different provinces of Iran were planted at National Institute of Ornamental Plants (NIOP) Iran. All of the species evaluated for 15 quantitative traits and 30 qualitative traits. Results showed that the highest positive correlation...

متن کامل

Molecular Dynamics Simulation of Al Energetic Nano Cluster Impact (ECI) onto the Surface

On the atomic scale, Molecular Dynamic (MD) Simulation of Nano Al cluster impact on Al (100) substrate surface has been carried out for energies of 1-20 eV/atom to understand quantitatively the interaction mechanisms between the cluster atoms and the substrate atoms. The many body Embedded Atom Method (EAM) was used in this simulation. We investigated the maximum substrate temperature Tmax  and...

متن کامل

Text clustering using frequent itemsets

Frequent itemset originates from association rule mining. Recently, it has been applied in text mining such as document categorization, clustering, etc. In this paper, we conduct a study on text clustering using frequent itemsets. The main contribution of this paper is three manifolds. First, we present a review on existing methods of document clustering using frequent patterns. Second, a new m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Pattern Recognition Letters

دوره 26  شماره 

صفحات  -

تاریخ انتشار 2005